计算机与现代化 ›› 2013, Vol. 1 ›› Issue (4): 1-4.doi: 10.3969/j.issn.1006-2475.2013.04.001

• 人工智能 •    下一篇

基于话题模型的视频动作识别系统研究

施 惟1,2   

  1. 1.上海交通大学智能计算与智能系统教育部-微软重点实验室,上海 200240;2.上海交通大学计算机科学与工程系,上海 200240
  • 收稿日期:2012-12-06 修回日期:1900-01-01 出版日期:2013-04-17 发布日期:2013-04-17

Human Action Recognition System Based on Topic Model

SHI Wei1,2   

  1. 1. MOE-Microsoft Key Laboratory for Intelligent Computing and Intelligent Systems, Shanghai Jiaotong University, Shanghai 200240, China;2. Department of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200240, China
  • Received:2012-12-06 Revised:1900-01-01 Online:2013-04-17 Published:2013-04-17

摘要: 从视频中识别人体动作是目前计算机视觉领域一个具有挑战性的方向。本文采用文本处理领域的bag-of-words方法,将视频表示为文章。在视频中寻找局部区域内在时间与空间上变化最大的点,作为时空兴趣点,在兴趣点上采集的视觉特征,作为文章中的词汇。在此基础上引入主题模型,对于视频中的隐含主题进行分析。最终通过主题在视频中的分布,经过判别法则识别其中的人物动作。通过在公开的视觉数据集上进行测试,结果表明本方法的表现接近或超过目前国际上领先的方法。

关键词: 人物动作识别, 时空兴趣点, bag-of-words模型, 主题模型

Abstract: Human action recognition from video sequences is a challenging problem in computer vision. This paper uses the bag-of-words paradigm inherited from text analysis to represent a clip of video as a document. The local features are extracted from spatio-temporal interest points which are points with local maximum variation in both space and time domain. Then topic models on video documents are applied to analyze the latent topics and actions in the video are recognized in a discriminative fashion. The proposed system is tested on both simple and complex data sets. Experiment result shows that the approach is comparable or better than all published state-of-the-art methods.

Key words: human action recognition, spatio-temporal interest point, bag-of-words model, topic model

中图分类号: